Model Selection

Self-supervised Vision Transformer

# Self-supervised Vision Transformer

Vit Huge Patch14 224.mae

A large-scale image feature extraction model based on Vision Transformer (ViT), pre-trained on the ImageNet-1k dataset using the self-supervised masked autoencoder (MAE) method

Image Classification

Vit Small Patch8 224.dino

Self-supervised image feature extraction model based on Vision Transformer (ViT), trained using the DINO method

Image Classification

Vit Base Patch8 224.dino

A vision Transformer (ViT) image feature model trained with the self-supervised DINO method, suitable for image classification and feature extraction tasks.

Image Classification

Beit Large Patch16 224 Pt22k

BEiT is a self-supervised learning model based on Vision Transformer (ViT), pretrained on the ImageNet-21k dataset for image classification tasks.

Image Classification

Beit Large Patch16 224 Pt22k Ft22k

BEiT is a Vision Transformer (ViT)-based image classification model, pre-trained in a self-supervised manner on ImageNet-22k and fine-tuned on the same dataset.

Image Classification

Beit Base Patch16 224 Pt22k

BEiT is a vision Transformer-based model pre-trained on the ImageNet-21k dataset through self-supervised learning for image classification tasks.

Image Classification

Beit Base Patch16 224 Pt22k Ft22k

BEiT is a Vision Transformer (ViT)-based image classification model, pre-trained in a self-supervised manner on ImageNet-22k and fine-tuned on the same dataset.

Image Classification

A Vision Transformer model trained using the DINO self-supervised method, based on the ViT architecture and pretrained on the ImageNet-1k dataset.

Image Classification

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase